## RANGE-BASED CACHE CONTROL SYSTEM AND METHOD

| This application is a continuation of Application No.  | 09/552,399, filed | April 19, | 2000, |
|--------------------------------------------------------|-------------------|-----------|-------|
| Patent No. 6,601,137, herein incorporated by reference |                   |           |       |

## **BACKGROUND OF THE INVENTION**

# Field of the Invention

The present invention relates to disk drive performance features and more particularly to a disk drive having a cache control system for improving the disk drive's response to host commands.

## **Description of the Prior Art**

A host computer stores and accesses data on a disk drive by issuing commands to the disk drive over a standardized interface. The smallest indivisible data unit addressable on a disk is a logical block or disk sector, typically of 512 bytes, and each such disk sector is assigned a logical block address (LBA). When the host computer sends a command to the disk drive, the nature of the command is specified, e.g., read or write, along with a start LBA and a count specifying the number of contiguous sectors to be transferred.

Existing disk drives typically have a semiconductor cache memory for temporarily storing disk data that is likely to be requested by a host computer. The response time latency for storing and accessing data in a semiconductor memory is much smaller than the response time latency for mechanically storing and accessing data stored on a rotating disk. In existing disk drives, if an entire LBA range of a host command is not found, or if the first LBA of the host command is not buried within a segment or range of LBA's stored in the cache memory, then a new cache segment is configured for responding to the host command. Accordingly, although the LBA range of the host command may overlap with the LBA range of a segment of the cache memory, that segment is essentially useless in responding to the host command.

Accordingly, there exists a need for a disk drive having a cache memory that may be configured to advantageously use existing cached data to respond to a host command. The present invention satisfies these needs.

## SUMMARY OF THE INVENTION

The invention may be embodied in a disk drive, and related method, having a cache memory and a cache control system. The cache memory has a plurality of memory clusters for caching disk data of disk sectors identified by logical block addresses. The cache control system has a tag memory and a scan engine. The tag memory has a plurality of tag records. Each tag record defines a variable length segment of the memory clusters for caching disk data for a range of logical block addresses and indicates the range of logical block addresses. The scan engine is only usable for scanning the tag records. The scan engine includes means for receiving a range of logical block addresses associated with a host command, means for reading the ranges of logical block addresses defined by the tag records, means for comparing the range of logical block addresses indicated in the tag records, and means for providing scan results, based on a comparison by the means for comparing, indicating overlap between the logical block address range associated with the host command and the ranges of logical block addresses indicated in the tag records.

In a more detailed feature of the invention, the means for comparing may further determine whether a first logical block address of a range of logical block addresses associated with a host command is within the ranges of logical block addresses indicated in the tag memory records. The means for providing scan results may indicate the tag records, determined by the means for comparing, having a range including the first logical block address. The means for providing scan results may indicate whether an entire logical block address range, a portion of the logical block address range, or none of the logical block address range associated with a host command is within the ranges of logical block addresses in the tag memory records. The means for providing scan results also may indicate whether the logical block address range associated with a host command is buried within a range of the ranges of logical block addresses in the tag records or whether only a portion of the logical block address range associated with a host command is within a range of the ranges of logical block addresses in the tag memory records. Further, the means for providing scan results may indicate whether the portion includes the beginning or the end of the logical address range associated with the host command.

In another more detailed feature of the invention, the scan engine may accept a scan command, associated with a host command, from a microprocessor, a host writable control store

| I  | and a host command decoder. The scan engine may include means for arbitrating between scan    |
|----|-----------------------------------------------------------------------------------------------|
| 2  | commands from the microprocessor, the host writable control store and the host command        |
| 3  | decoder.                                                                                      |
| 4  | BRIEF DESCRIPTION OF THE DRAWINGS                                                             |
| 5  | The accompanying drawings illustrate embodiments of the present invention and,                |
| 6  | together with the description, serve to explain the principles of the invention.              |
| 7  | FIG. 1 is a block diagram of a disk drive having a cache control system for scanning a tag    |
| 8  | memory for overlap between ranges of cached data and a host command range, according to the   |
| 9  | present invention.                                                                            |
| 10 | FIG. 2 is a block diagram showing the cache control system of FIG. 1, having the tag          |
| 11 | memory and a scan engine, according to the present invention.                                 |
| 12 | FIG. 3 is a block diagram showing a table of tag records in the tag memory of the cache       |
| 13 | control system of FIG. 1, for defining segments of memory clusters for caching ranges of disk |
| 14 | data.                                                                                         |
| 15 | FIG. 4 is a data structure for a tag record in the table of tag memory records of FIG. 3.     |
| 16 | FIG. 5 is a data structure for a status and control flag in the data structure of FIG. 4.     |
| 17 | FIG. 6 is a data structure for a cluster control block for use by the tag records of FIG. 3.  |
| 18 | Fig. 7 is a block diagram of the scan engine of the cache control system of FIG. 2.           |
| 19 | Fig. 8 is a block diagram of a host command logical block address (LBA) range versus a        |
| 20 | tag record LBA range, showing a full cache hit.                                               |
| 21 | Fig. 9 is a block diagram of a host command LBA range versus a tag record LBA range,          |
| 22 | also showing a full cache hit.                                                                |
| 23 | Fig. 10 is a block diagram of a host command LBA range versus a tag record LBA range,         |
| 24 | showing a full cache hit buried.                                                              |
| 25 | Fig. 11 is a block diagram of a host command LBA range versus a tag record LBA range,         |
| 26 | showing a partial cache hit head.                                                             |
| 27 | Fig. 12 is a block diagram of a host command LBA range versus a tag record LBA range,         |
| 28 | showing a partial cache hit head buried.                                                      |
| 29 | Fig. 13 is a block diagram of a host command LBA range versus a tag record LBA range,         |
| 30 | showing a partial cache hit mid.                                                              |

| I  | Fig. 14 is a block diagram of a nost command LBA range versus a tag record LBA range,            |
|----|--------------------------------------------------------------------------------------------------|
| 2  | showing a partial cache hit tail.                                                                |
| 3  | Fig. 15 is a block diagram of a host command LBA range versus a tag record LBA range,            |
| 4  | showing a cache miss.                                                                            |
| 5  | Fig. 16 is a block diagram of a host command LBA range versus a tag record LBA range,            |
| 6  | showing a cache miss sequential head.                                                            |
| 7  | Fig. 17 is a block diagram of a host command LBA range versus a tag record LBA range,            |
| 8  | showing a cache miss sequential tail.                                                            |
| 9  | Fig. 18 is a block diagram of a host command LBA range versus tag record LBA ranges,             |
| 10 | showing multiple cache hits.                                                                     |
| 11 | Fig. 19 is a flow chart showing a method for scanning tag records for overlap between a          |
| 12 | host command range and ranges of cached data.                                                    |
| 13 | <b>DETAILED DESCRIPTION</b>                                                                      |
| 14 | With reference to FIG. 1, a disk drive 10 comprises a cache control system 12, and a             |
| 15 | cache memory 14 having a plurality of memory clusters 46 for caching disk data stored in sectors |
| 16 | (not shown) on disks of a disk assembly 38. Conventionally, the disk sectors are identified by   |
| 17 | logical block addresses (LBAs). The cache control system comprises a tag memory 22 and a         |
| 18 | scan engine. The tag memory 22 is embedded in the cache control system 12 and has a plurality    |
| 19 | of tag records 40 for defining variable length segments of the memory clusters 46 for caching    |
| 20 | disk data for ranges of LBAs. The scan engine 26 is also embedded within the cache control       |
| 21 | system 12 and thereby configured only for use in scanning tag records 40. The scan engine 26     |
| 22 | includes a comparator for comparing a host command range with the tag record LBA ranges in       |
| 23 | indicate overlap between the ranges. The cache control system 12 is effective in exploiting      |
| 24 | existing cached data for LBA ranges overlapping with the host command LBA range.                 |
| 25 | The disclosures of the following three U.S. Patent Applications are hereby incorporated          |
| 26 | by reference: application serial number 09/552,404, filed on April 19, 2000, now patent number   |
| 27 | 6,553,457 titled TAG MEMORY DISK CACHE ARCHITECTURE; application serial number                   |
| 28 | 09/552,407, filed on April 19, 2000, titled CLUSTER-BASED CACHE MEMORY                           |
| 29 | ALLOCATION; and application serial number 09/552,402, filed on April 19, 2000, titled            |
| 30 | CACHE CONTROL SYSTEM AND METHOD HAVING HARDWARE-BASED TAG                                        |

RRF: A1302app.doc 7/25/03

#### RECORD ALLOCATION.

| 3    | and a host interface 18. The host interface 18 receives host commands from a host 20, such as a  |
|------|--------------------------------------------------------------------------------------------------|
| 4    | personal computer, and transfers disk data between the disk drive 10 and the host 20. The host   |
| 5    | commands identify the disk data using a start logical block address (LBA) and a count specifying |
| 6    | the number of contiguous sectors to be transferred. The cache memory 14 caches the disk data     |
| 7    | under the direction of the cache control system 12 and the microprocessor 16. The                |
| 8    | microprocessor 16 operates under firmware control and manages the operation of the disk drive    |
| 9    | 10 and assists hardware elements under specific conditions. The cache memory 14 is random        |
| 10 - | access memory, typically 2 megabytes (MB). Generally, the larger the cache memory 14, the        |
| 11   | better the performance of the disk drive 10 in responding to host commands. The cache control    |
| 12   | system 12 includes the aforementioned tag (random access) memory (RAM) 22, the                   |
| 13   | aforementioned scan engine 26, and a results register 30.                                        |
| 14   | The disk drive 10 also includes a disk channel 36 and the aforementioned disk assembly           |
| 15   | 38. The disk assembly 38 includes a hard disk platter that is organized into the disk sectors,   |
| 16   | typically of 512 bytes plus redundancy bytes for error correction, which are individually        |
| 17   | addressable using a logical block address (LBA). The disk channel 36 performs conventional       |
| 18   | encoding and decoding of data written to and read from the disk.                                 |
| 19   | The cache control system 12 is shown in more detail in FIG. 2. The cache control system          |
| 20   | 12 includes the tag memory 22, a CCB memory 24, and the scan engine 26. The tag memory 22        |
| 21   | is a static random access memory (SRAM) structure, which is preferably embedded in an            |
| 22   | integrated controller chip, having a table of tag or segment records. The CCB memory 24 is also  |
| 23   | preferably embedded SRAM having a plurality of records or CCBs (cluster control blocks) 34.      |
| 24   | The tag memory 22 may be accessed by the microprocessor 16, the scan engine 26 and a             |

With reference again to FIG. 1, the disk drive 10 further includes a microprocessor 16,

The tag memory 22 may be accessed by the microprocessor 16, the scan engine 26 and a host writable control store (HWCS) 28, and may be updated by the microprocessor 16 and the HWCS 28. The scan engine 26 is coupled to the host interface 18 and receives host commands and scans the tag memory 22 for the LBA ranges associated with a host command. The scan engine 26 places the scan results in a results register 30 or, if servicing the host command further requires intervention by the microprocessor, the HWCS 28 places the command in a command queue 32. The scan engine is described in more detail below. The command queue 32 has a read

```
miss queue and a write command first-in first-out (FIFO) queue. If a host command may be
     responded to by the cached data referenced in the tag memory 22, then the HWCS 28 manages
 2
     the response to the host command, otherwise the microprocessor 16 may assist with the response.
 3
     Thus, the HWCS 28 offloads cache tasks from the microprocessor 16 enabling response to host
 4
     commands for data already in the cache memory 14 without microprocessor intervention.
 5
             The tag memory 22 is described in more detail with reference to FIGS. 3 and 4. The tag
 6
     memory 22 has a plurality of the tag records 40 that define segments, 42 and 44, of the memory
 7
     clusters 46 within the cache memory 14. Typically, the tag memory 22 may have 32 or 64
 8
     records dedicated to defining variable length segments. Other tag memory records (not shown)
 9
     may be dedicated to single block transfers for caching small data elements stored within one
10
     memory cluster 46 that are repeatedly accessed by the host 20. The cache memory 14 is divided
11
     into sectors 48. The cache sectors 48 are bunched into consecutively numbered groups or
12
     clusters. Each cluster 46 has a particular cluster number. Preferably, each cluster 46 has 16
13
     cache sectors 48, although the number of sectors 48 in each cluster 46 may be selected based on
14
15
     the size of the cache memory 14, the size of the CCB SRAM 24, and the operational
     characteristics of the host 20.
16
17
             The tag memory 22 defines the segments of the cache memory clusters 46 using the
     CCBs 34. Each tag record 40 has entries or fields (50, 52, 54, 56, 58 and 60), respectively for
18
19
     indicating the first disk LBA assigned to the corresponding segment, the number of valid sectors
     in the segment, the number of sectors allocated to the segment, the first segment CCB, the last
20
     segment CCB, and state and control flags for the segment. As shown in FIG. 6, each CCB has a
21
     pointer 62 to a next CCB in a segment or to indicate that the CCB is the last CCB in the segment.
22
     Accordingly, a tag record 40 defines a segment by recording the segment's first CCB in the first
23
     CCB entry 56. The first CCB 34 has a pointer 62 to the next or second CCB in the segment. The
24
25
     second CCB likewise has a pointer 62 to the next CCB until the last CCB in the segment. The
     last CCB has an indicator such as a null value that indicates the end of the segment. Two short
26
     exemplary segments, 42 and 44, are shown in FIG. 3. The first segment 42 is defined by the tag
27
     record number 1 to have three clusters 46. The second segment 44 is defined by the tag record
28
     number 29 to have a length of two clusters 46. The tag memory is described in more detail in the
29
     above-referenced U.S. application serial number 09/552,404, titled TAG MEMORY DISK
30
```

CACHE ARCHITECTURE. The tag records for single block transfers have entries, 50 and 60, for only the first LBA and the state and control flags. 2 The cache control system 12 (FIG. 2) also includes a free list 64 and a most-recently-3 used/least-recently-used (MRU/LRU) engine 66. The free list 64 tracks any CCB's 34 not 4 assigned to a tag record 40. Accordingly, all CCBs 34 are assigned to either a tag record 40 or to 5 6 the free list 64. The CCBs 34 and the free list 64 is described in more detail in the abovereferenced U.S. application serial number 09/552,407, titled CLUSTER-BASED CACHE 7 MEMORY ALLOCATION. The MRU/LRU engine 66 keeps track of the currency of the cached 8 data associated with each tag record 40 in the tag memory and is described in more detail in 9 above-referenced U.S. application serial number 09/552,402, titled CACHE CONTROL 10 SYSTEM AND METHOD HAVING HARDWARE-BASED TAG RECORD ALLOCATION. 11 The preferred data structure of the entries (FIG. 4) in the tag records 40 is now described. 12 The first entry 50 in a tag record 40 is a 32-bit address representing the first logical block address 13 of the segment being defined by the tag record 40. The next entry 52 in the tag record 40 is a 10-14 15 bit valid count representing the number of valid sectors in the segment. The valid count represents the valid data sectors in the cache memory 14. The next entry 54 in the tag record is a 16 10-bit allocated count representing the number of cache sectors 48 allocated to the segment. The 17 allocated count is generally equal to the valid count upon command completion except when the 18 command is prematurely aborted. The valid count is never greater than the allocated count. The 19 next entry 56 in the tag record 40 is an 8-bit first segment CCB pointer to the first CCB 34 used 20 in the segment. The next entry 58 in the tag record 40 is an 8-bit last segment CCB pointer. The 21 22 next entry 60 in the tag record 40 is a series of status and control flags for use by the scan engine 26, the microprocessor 16 and the HWCS 28. Among other things, the status and control flags 23 are used for managing tag record allocations and transfers of data between the memory clusters 24 25 46 and the host 20, and between the memory clusters 46 and the disk 38. As shown in FIG. 5, the status and control flag entry 60 includes a 2-bit control flag 70, a 26 2-bit status flag 72, and a 1-bit Q scan flag 74. The control flag 70 indicates ownership of the tag 27 record 40. Ownership of a tag record may be maintained by the microprocessor 20, the HWCS 28 28, or the scan engine 26. The status flag 72 indicates the status of the disk data stored in the 29 memory clusters 14 associated with the tag record 40. The status may be free, available, valid, or 30

7

RRF: A1302app.doc 7/25/03

- valid/dirty. The free status indicates that no valid data is associated with the tag record 40. The
- 2 available status indicates that valid data is associated with the tag record 40, but that the data may
- be discarded and the tag record 40 reused. The valid status indicates that valid data is stored in
- 4 the memory clusters 46 of the segment defined by the tag record. The valid/dirty status indicates
- a segment of memory clusters 46 having valid data that has not been written to the disk 38. All
- 6 host write data is marked as valid/dirty when it transferred into the cache memory by the HWCS
- 7 28. The Q scan flag 74 is used during a review of the tag records 40 by the MRU/LRU engine
- 8 66.
- The scan engine 26 is shown in more detail in FIG. 7. The scan engine includes the
- aforementioned comparator 76 for comparing an LBA range associated with a host command
- with LBA ranges in the tag records. A scan of the tag records 40 may be requested by the
- microprocessor 16, the HWCS 28, or a host command decoder 78. The scan engine 26 receives
- an LBA range associated with a host command from the command decoder 78, the
- microprocessor 16, or the HWCS, through a multiplexer 80. The command LBA range is loaded
- into the comparator 76 and the LBA ranges in the tag records 40 are read into the comparator 76.
- The comparator 76 compares the LBA ranges and indicates in the results register 30 any overlap
- or hits between the LBA ranges. The results register 30 indicates (from a scan of all the tag
- records 40) the type of overlap, the tag record 40 of the overlap, and a bitmap of all the segments
- that have an overlap.
- The types of overlap in the LBA ranges are defined with reference to FIGS. 8-18 and the
- 21 logical equations below. The comparison results may include: full cache hit (fhit), full hit buried
- 22 (fhtb), partial hit head (phth), partial hit head buried (phhb), partial hit mid (phtm), partial hit tail
- 23 (phtt), cache miss (miss), sequential miss head (seqh), and sequential miss tail (seqt). The start
- LBA (csa) and the end LBA (cea) of the host command LBA range are compared with the start
- 25 LBA (tsa) and the end LBA (tea) of the tag records. The comparison results are defined as:

```
1
                    phtt
                              (csa < tsa) & (cea > tsa) & (cea < tea)
                                                                         FIG. 14
                    miss
                              (csa > tea) or (cea > tsa)
2
                                                                         FIG. 15
3
                    segh
                               (csa == tea)
                                                                         FIG. 16
                    segt
                              (ceà == tsa)
                                                                         FIG. 17
4
```

In FIGS. 8-18, the LBA range overlap is shown highlighted in the host command LBA range. The scan engine 26 may also indicate multiple hits as shown in FIG. 18. The scan engine 26 provides the number of tag record(s) 40 having an LBA range overlap. On a cache miss, the scan engine 26 provides a number for a free tag record 40 for use in responding to the host command. If no free tag records are available, the scan engine 26 may indicate a free tag error.

Only one scan request may be serviced at a time. The highest scan priority is given to requests from the host command decoder 76, next to the HWCS 28, and lowest prior is given to the microprocessor 16. The HWCS 28 and the microprocessor 16 have respective done bits set in a register 82 (FIG. 7) after a requested scan is completed. If the host command decoder 78 requests a scan during a scan by the HWCS 28, the HWCS done bit is not set until the decoder scan is complete so that the decoder scan effectively overrides the HWCS scan. Scan requests by the HWCS 28 during a decoder scan are ignored.

The present invention also may be embodied in a method, shown in FIG. 19, for servicing host commands using a cache memory 14 having a plurality of memory clusters 46 for caching disk data of disk sectors identified by logical block addresses (step 190). The method includes providing a tag memory 22 having a plurality of tag records 40 (step 192). Each tag record 40 defines a variable length segment of the memory clusters 46 for caching disk data for a range of logical block addresses and indicates the range of logical block addresses. Next, a range of logical block addresses associated with a host command is received (step 194). The ranges of logical block addresses defined by the tag records 40 are read (step 196) and compared (step 198) with the range of logical block addresses associated with the host command. The scan results, based on the comparing step, are provided (step 200) indicating overlap between the logical block addresses range associated with the host command and the ranges of logical block addresses indicated in the tag records 40.

Accordingly, the cache control system 12 generates scan results that permit response to a host command using existing cached data in an overlapping LBA range. The microprocessor 16

# PATENT ATTY DOCKET K35A1302

- may then set up a tag record 40 for defining a segment of memory clusters 46 to cache the
- 2 remaining disk data not already in the cache memory 14.

RRF: A1302app.doc 7/25/03